Data Harvesting 2.0: from the Visible to the Invisible Web

نویسندگان

  • Claude Castelluccia
  • Stéphane Grumbach
  • Lukasz Olejnik
چکیده

Personal data are fuelling a fast emerging industry which transform them into added value. Harvesting these data is therefore of the outermost importance for the economy. In this paper, we study the flows of personal data at a global level, and distinguish countries based on their capacity to harvest data. We establish a cartography of international data channels on the visible and invisible Web. The visible Web is composed of the sites that are available to the general public and are typically indexed by search engines. The invisible Web refers to tags, Web bugs, pixels and beacons that appear on Websites to track and profile users. It is well known that the US dominate the visible Web with more than 70% of the top 100 sites in the world. We show that this domination is even stronger on the invisible Web.The largest proportion of trackers in most countries are indeed from the US. Apart from the US, two countries exhibit an original strategy. China, which dominates its visible Web with a majority of local sites, but surprisingly these sites still contain a majority of US trackers. Russia, which also dominates its visible Web, and is the only country with more local trackers than US ones.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

‌وب مرئی و نامرئی: تجزیه و تحلیل استفاده از محیط وب بر اساس مدل ایده‌آل تیپ ماکس وبر‌

Using the Web has become ubiquitous and an indispensable part of scientists’ daily life. Although there are many studies dealing with the use of the Web, few studies have focused on how different user groups including scientists make use of visible and invisible parts of the Web for educational and research purposes. This article first introduces the visible and invisible parts of the Web, and ...

متن کامل

Invisible Phenomena in the Overall Personality of Man, in the Interpretive Study of the Verses 38 and 39 of Haqqah

There is a visible and invisible element in all creatures. There are also realities in human beings, some of which are visible and most of which are invisible. The preference of the invisible is not limitted only to quantities but includes qualitaties also. This division is inspired by the verses 38 and 39 of Haqqah: Most commentators of the Holy Qur'an believe that the external instances of th...

متن کامل

Familiarity with and Use of Web 2.0 Tools in Library Services by Librarians Working at Iran, Tehran, and Shahid Beheshti Universities of Medical Sciences

Background and Aim: Web 2.0 technology has various usages in libraries all over the world. According to studies, however, it seems that this technology is rarely used in Iranian academic libraries. Therefore, the present study aims to determine the level of familiarity with and use of Web 2.0 tools among librarians working at Iran, Tehran, and Shahid Beheshti Universities of Medical Sciences. ...

متن کامل

Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...

متن کامل

Investigating Dynamic Writing Assessment in a Web 2.0 Asynchronous Collaborative Computer-Mediated Context

This study aims at investigating the effect of dynamic assessment (DA) on L2 writing achievement if applied via blogging as a Web 2.0 tool, as well as examining which pattern of interaction is more conducive to learning in such an environment. The results of the study indicate that using weblogs to provide mediation contributes to the enhancement of the overall writing performance, vocabulary a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013